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09/018,248 filed February 3, 1998. This application also 
claims the benefit of the filing date of U.S. Provisional 
Application Serial Number 60/086,654 filed May 26, 1998. 

FIELD OF THE INVENTION 

This invention provides methods for DNA analog 
representation of vector operations, including vector 
addition, determination of inner and outer products of 
vectors, and of the product of a matrix and a vector, using 
negative as well as non-negative numbers. The methods of the 
present invention utilize the spectrum of biochemical 
activities and operations which DNA molecules are capable of 
undergoing, including base-specific Watson-Crick hybridiza- 
tion, ligation, polymerase extension, site-specific strand 
cleavage via restriction enzymes, melting of duplex DNA, 
cleavage of DNA by site-specific endonucleases, and degrada- 
tion of DNA by exonucleases of broad sequence specificity. 



Watson-Crick hybridization of complementary DNA oligomers 
makes possible a DNA analog representation of highly parallel 
operations [1, 2] . The present invention develops this 
potential and provides methods whereby DNA analog 
representation of the operations of vector algebra is used to 
produce a DNA-based neural network [3] which may be used in an 
associative or content addressable memory [4-6] and a DNA 
multilayer perceptron [7, 8]. 

BACKGROUND OF THE INVENTION 

All publications and patent applications referred to 
herein are incorporated by reference fully as though each 
individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 

Various strategies for finding solutions to mathematical 
problems have been devised which use sets of DNA 
oligonucleotides having selected length and sequence 
properties. For example, there are methods that use DNA 
oligomers of defined nucleotide sequence to solve a 
Hamiltonian path problem [1], a "satisfaction" problem [2] and 
for performing addition [9] and matrix multiplication [10] of 
non-negative numbers. Baum [11] has proposed using DNA 
operations akin to those described by Adleman [1] and Lipton 
[2] to produce an associative DNA memory of enormous capacity. 



Prior to the development of the methods of the present 
invention, methods for using DNA oligomers in analog 
representation of matrix multiplication that include use of 
negative numbers as well as non-negative numbers were not 
disclosed or taught. 

Adleman [1] first pointed out that Watson-Crick 
hybridization of complementary DNA strands makes possible a 
representation of highly parallel selective operations that 
could be a basis for molecular computation. In practice, 
small departures from the ideal selectivity of DNA 
hybridization can lead to undesired pairings of strands that 
create significant difficulties in implementing schemes using 
interactions of DNA oligomers to represent large scale Boolean 
functions. Recently, however, Deaton et al. [12] showed that 
it should be possible to find a large enough set of mutually 
non-hybridizing DNA strands to allow digital molecular 
computation of high complexity with tolerable error rates. 

A neural network is a physical system that models a 
simple biological neuronal system, in that it comprises a 
large number of interconnected processing elements, called 
neurons. The activity of a given neuron is determined by the 
weighted sum of all of the signals that the neuron receives 
from the neurons to which it is connected. In most neural 
network models, the total activity of the ith neuron, called a 



"perceptron^. " is 

Si = ^io + L ^ij^j 

where is the signal received from the jth neuron that is 
weighted by an amount w^^, w^.^ is a bias weight, and is usually 
negative. The ith neuron responds to incoming signals by 
itself sending a signal y = F(aO . The function F(aJ is a 
saturating function; a common choice is the non-linear 
logistic or sigmoid function, 

F(a) = (1 + exp(-a) ) 
which restricts the output to be between 0 and 1, and gives an 
approximately linear response for small levels of activity. 
Thus, the activity of the ith perceptron is positive when the 
sum of the incoming weighted signals is larger than the 
negative bias weight; and when the incoming signal is 
sufficiently large, the output of the ith perceptron is 
approximately 1 (see, for example, W. Penny et al., page 38 5- 
387, in [8]). From the parallel operations and interactions 
of the neurons emerge collective properties that include 
production of a content-addressable memory which correctly 
yields an entire memory from any subpart of sufficient size 
[3] . Neural networks do not need the high precision 
associated with digital computing [3] . Because they are fault 
tolerant, such neural networks can be represented by DNA with 
the massive parallelism first envisioned by Adleman [1] . 



BRIEF SUMMARY OF THE INVENTION 

The present invention provides a method for DNA-based 
analog representation of the operations of vector addition and 
vector and matrix algebra, using negative as well as non- 
5 negative numbers, wherein a subset of all single-stranded DNA 

n-mers is in 1:1 correspondence with the basis vectors e. , i = 
l,2,...,m, in an abstract m-dimensional vector space; an m- 
component vector V in a space with basis vectors e^, i = 1 

through m, is represented by the equation V = 2^7^ e^, and its 

10 analog representation is a DNA sample containing strands or 

m their complement E., for each i = 1 through m, where the 

fU presence of E. or E^ is determined by the sign of the amplitude 

ivp V. of the ith component of the vector, and the concentration 

rg of each E^ or E^ is proportional to the magnitude of the 

IS amplitude V^. 

P The present invention further provides a method for 

implementing an analog neural network, wherein the data of the 
processing units, or neurons, is in the form of m-component 
vectors V = e^, each of which is represented by a set of 

20 the oligomers as described above. The interconnections and 

the transmission of signals between the neuronal units are 
represented by biochemical processes and reactions involving 
the oligomers E^ and E^; such processes and reactions include 



5 



diffusion, molecular recognition, and specific hybridization 
of complementary oligomers, and nucleotide sequence-specific 
reactions of nucleic acid-modifying enzymes acting on the 
oligomers, as occur in analog operations of vector addition 
and vector and matrix algebra. Application of a saturating 
function to a signal from one or more neuronal units to 
produce an output is represented by hybridization of a set of 
oligomers selected by said set of biochemical reactions to a 
complete, sub-stoichiome trie set of single-stranded E. and 
oligomers, and an output of the neural network is represented 
by a set of oligomers that specifically hybridize to said sub- 
stoichiometric set of E^ and E. oligomers. 

In a specific embodiment, an analog content addressable 
memory is produced by representing elements of memory as m- 

component vectors V = ©i/ wherein items of experience, a 

set of vectors Vi"", are stored in memory by forming the outer 
product over all the experience vectors for i^j : 

Ti: = La V,^ V.^ ; 
wherein recall of a particular experience V^*^ imperfectly 
represented as Ui"" is effected by the algorithm: 

Vi = S(X V. + U,^) ; 
where the function S(x) is a saturating function such as 

g • tanh ( x ) , 

with g being the small-signal gain; and wherein the saturating 



Figure 3 schematically represents steps in the 
determination of the inner product of V and W in which primers 
A and B are hybridized at their 3' ends to the 3' ends of the 
DNA strands of vector V or W, and the 3' ends of the primer 
strands are extended to produce DNA strands V or W that are 
complementary to the V or W template strands. 

Figure 4 schematically represents steps in the 
determination of the outer product of V and W in which a 
bridging linker oligomer aligns the 3' end of a V. strand and 
5' end of a W^{F} strand for efficient ligation to obtain a 
strand of the form {E.}{Ej}{F}. 

Figure 5 schematically represents steps in the 
determination of the product of vector X and matrix T^j, the 
outer product of V^Vj, in which DNA strands containing {Xj} are 
used to select {Vi} strands of the form {Ei} and {E^} 
representing an unchanged sign contribution to the product. 
The abbreviation RE stands for restriction enzyme. 



Figure 6 schematically represents steps in the 
determination of the product of vector X and matrix T^j in 
which DNA strands containing {X., } are used to select {V^} 



strands of the form {Ei} and {E^} representing a changed sign 
contribution to the product. 

Figure 7 schematically shows the "writing" of an image to 
memory. Following exposure of a DNA-saturated substrate, 
e.g., a DNA chip, to an image, with selective denaturation of 
double-stranded complexes at the image pixels, the oligomers 
that represent the image are collected. The image data is 
formed into an outer product, and the DNA strands representing 
all of the images to be stored are pooled to give the sum of 
the outer products, which is the memory matrix T^j of Eq. 2. 

Figure 8 schematically portrays retrieving a complete 
image from the memory matrix T^., starting with a set of DNA 
strands that represent U^, a vector containing only a very 
small part of the image. A saturating function is applied to 
the set of DNA strands representing the product of the memory 
matrix T^. and the incomplete vector to yield a set of DNAs 
representing Xi . Iterations of the method continue until a 
set of DNA strands is obtained that represents the complete 
image . 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention employs method steps in which 
vectors are represented by custom sets of DNA oligonucleotides 



and/or oligonucleotide analogs having selected subunit 
sequences, and in which vector operations, including vector 
addition and vector algebra, are represented in analog form by 
well known reactions and manipulations of the DNA oligomers 
representing the vectors, such as hybridization, ligation, and 
cleavage by nucleases, separation of single-stranded from 
double-stranded oligomers, and separation of short oligomer 
fragments from longer oligomers. Unless otherwise indicated, 
the present invention is practiced using conventional 
techniques of chemistry, biochemistry, and molecular biology, 
which are well-known and are within the capabilities of a 
person of ordinary skill in the art. (for example, see [13] - 
[18] ) . 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as commonly understood 
by one of ordinary skill in the art to which this invention 
belongs. Although any methods and materials similar or 
equivalent to those described herein can be used in the 
practice or testing of the present invention, the preferred 
methods and materials are described. 

Nucleic Acid Oligomers 

Oligomers are linear sequences of relatively few 
subunits. An oligomer having n subunits is referred to as an 
n-mer; for example, a nucleic acid oligomer that contains 12 

10 



or 17 nucleotides is referred to as a 12-mer or as a 17-mer, 
respectively. As used herein, the term oligomers refers to 
RNA or DNA oligonucleotides, RNA or DNA oligonucleotide 
analogs, or a combination of RNA and/or DNA oligonucleotides 
and RNA and/or DNA oligonucleotide analogs, which are used in 
representing vectors and matrices, operations involving 
vectors and matrices, including vector addition and vector 
algebra, and in implementing a neural network. The RNA or DNA 
oligonucleotide analogs employed for the present invention can 
be oligomers in which from one to all nucleotide subunits are 
replaced with a nucleotide analog to confer desired properties 
such as increased detectability , increased hybridization 
affinity, and resistance to degradation by a nuclease. Such 
oligonucleotide analogs include but are not limited to 
oligomers comprising 2'-0-alkyl ribonucleotides, 
phosphorothioate or methylphosphonate internucleotide 
linkages, peptide nucleic acid subunits, and nucleotides 
modified by attachment of radioactive or fluorescent groups, 
groups which intercalate, cross-link or cleave a nucleic acid, 
or groups which alter the electric charge or hydrophobicity of 
the oligomers. Methods for making and using oligonucleotides 
and oligonucleotide analogs such as those listed above are 
well known to those skilled in the art of making and using 
sequence-specific hybridizing oligomers . 



11 



In general, an essential characteristic of the oligomers 
employed in practicing the invention is that they are able to 
hybridize specifically to oligomers having complementary 
subunit sequences to form stable double-stranded complexes. 
The statement that an oligomer hybridizes specifically to 
another oligomer is intended to mean that a portion of a first 
oligomer comprising a nucleotide sequence complementary to a 
sequence in a second oligomer binds by Watson-Crick base- 
pairing to the complementary portion of the second oligomer to 
form a stable double-stranded complex, under hybridization 
conditions that are sufficiently stringent that oligomer 
molecules having fewer bases complementary to, or forming less 
stable duplex structures with, the second oligomer do not also 
hybridize to the second oligomer and form stable double- 
stranded complexes. Selection of parameters such as the 
lengths of the complementary portions of the different 
oligomers and the conditions used in hybridization and wash 
steps, so that the oligomers hybridize specifically to their 
counterparts, is well within the capabilities of a person of 
ordinary skill in the art (e.g., see chapter 11 of [13]). 

The sizes of the oligomers employed in practicing the 
present invention can range from about 4 subunits to 100 or 
more subunits in length. One skilled in the art would 
appreciate that in order for the oligomers to hybridize 
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specifically to form stable double-stranded complexes, the 
oligomers representing basis vectors should be at least about 
6-8 nucleotides in length. Depending on the complexity of the 
data being represented, the basis vectors can be represented 
by DNA n-mers of from about 8 up to about 20, 30, or 50 or 
more nucleotides; preferably about 10 to about 30 nucleotides. 
As discussed below, n-mers representing basis vectors 
preferably have about 6 to 12 additional nucleotides added to 
each end to stabilize hybridization of termini, to permit re- 
separation of oligomers joined end-to end, and to permit 
amplification by PGR using a common set of primers. Thus, DNA 
oligomers representing basis vectors are typically about 18 to 
about 74 or more nucleotides in length; preferably about 20 to 
about 50 nucleotides in length. In the methods involving 
hybridization of basis vector oligomers to oligomers attached 
to a substrate, the basis vector oligomers 

can be longer, shorter, or the same length as the attached 
oligomers. In addition, basis vector oligomers having 
different lengths, chemical structures and properties, can be 
hybridized to different sites of the same oligomer array. 
Those skilled in the art appreciate that the specificity and 
affinity with which oligomers hybridize to each other are 
determined, in large part, by the length, nucleotide sequence, 
and chemical structure of the oligomers, and so are able to 
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select structural parameters of the oligomers employed in the 
present invention that are appropriate for their intended use. 
For example, the subunit sequences of the different basis 
vector oligomers and of oligomers attached to solid substrates 
can be selected so that the oligomers do not comprise self- 
complementary sequences that stabilize folding of said 
oligomers into hairpin structures which interfere with 
formation of inter-strand duplexes. Additionally, the subunit 
sequences of the oligomers can be selected so that the melting 
temperatures (Tm) of the double-stranded complexes formed by 
hybridization of the complementary portions of the basis 
vector oligomers and the attached oligomers at the different 
sites of the array are all within a selected temperature 
range, e.g., in the range of a selected Tm plus or minus about 
5 degrees C, for more efficient control of oligomer 
hybridization and release. 



Obtaining the Oligomers 

The present invention employs multiple sets of large 
numbers of different oligomers, for example, DNA 
oligonucleotides, having specified lengths and nucleotide 
sequences. The oligomers of the present invention can be made 
by well-known methods that are routinely used by those skilled 
in the art of synthesizing oligonucleotides and/or 
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oligonucleotide analogs (for example, see [16, 17, and 19]). 

An Oligomer Storing Device 

In the preferred embodiments of the invention, the 
required oligomers are synthesized and stored in an oligomer 
storing device, from which they are released as they are 
needed. A preferred oligomer storing device comprises a 
substrate supporting an array of oligomer storage sites, or 
depots, each of which comprises a surface to which are 
attached oligomers having a selected subunit sequence. The 
oligomers are stored in the depots of the storing device by 
allowing them to hybridize by Watson-Click pairing to the 
oligomers attached to the surfaces of said depots to form 
double-stranded complexes. When oligomers having a particular 
nucleotide sequence are needed, they are released from the 
oligomer storing device by locally denaturing the double- 
stranded complexes of the depot in the storage array where 
they are stored, e.g., by localized heating, without 
denaturing double-stranded complexes of the depots storing 
other oligomers, and the desired oligomers are collected from 
the solution in contact with the oligomer storage array. 

The substrate supporting the array of oligomer storage 
sites, or depots, can have a flat surface that supports the 
array, or it can be distributed in three dimensions, such as 
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in a gel, a fibrous or granular matrix, or in a porous solid. 
A substrate which is suitable for supporting immobilized 
nucleic acids for hybridization analysis can, in general, be 
adapted for use as an oligomer storage device of the present 
invention. Accordingly, a variety of different designs and 
materials are available for preparing the oligomer storing 
device of the present invention. For example, the storage 
device may be a flexible filter, e.g., of nylon or 
nitrocellulose, or it may be of a rigid material such as 
silica, silicon, glass, crystalline AI2O3 ("synthetic 
sapphire"), beryllium oxide, or a solid substrate coated with 
a noble metal such as gold. Methods for making such substrate 
supports for hybridizing oligomers are well known to those 
skilled in the art (see [20] col. 6, lines 1-39; [21] col. 11, 
lines 49-63; [22] col. 9, line 10, to col. 32, line 7; [23] 
pages 40-42; [24] pages 114-128; [25] pages 607-609, and 
[26] ) . The storage device may also include a chamber or 
container housing the substrate, through which oligomer 
hybridization, wash, and collection solutions are directed, 
for convenience of use similar to that of a chromatography 
column . 

The term depot refers to a site at which oligomers are 
stored, and comprises a delimited area or volume that is part 
of or is attached to the supporting substrate, to the surface 
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of which are attached hybridizing oligomers comprising a 
selected subunit sequence (for example, see (2) in Figures 1 
and 2) . A depot site can have any size, shape, or volume, 
consistent with the objective of the invention of storing and 
selectively releasing oligomers as needed. By array is meant 
an arrangement of locations, or depots, in or on the 
supporting substrate of the oligomer-storing device. The 
depots can be arranged in 2- or 3- dimensional arrays, or 
other matrix formats. Figure 1 shows a 2-dimensional 4x5 
array of depots on a supporting substrate. The number of 
depots in the array can range from 2 to 10' or more. It is 
within the knowledge of those skilled in the art to fabricate 
a rigid substrate supporting an array of oligomer depot sites 
that can range in diameter from about 1 micron to 1 centimeter 
or more, and an array of depot sites of 5-10 microns in 
diameter can readily be made with an array density of about 
10^ depot sites per cm^ (see [20] col. 8, lines 50-68; [21] 
col. 9, lines 10-18; and [23] page 40). All of the depot 
sites of a given array can have the same diameter, or a single 
depot array can comprise depot sites having different 
diameters. Preferably, the oligomer storing device stores 10^ 
to 10"^ different types of oligomers of about 6 to about 100 
subunits in length in a micro-array of thermally isolated 
depot sites on a rigid substrate. 
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The oligomers attached at the depot sites can be attached 
directly to the surface of the substrate, or to the surface of 
a pad or pedestal-like structure that is itself attached to 
the substrate, and which is made of material that is the same 
as, or different from, that of the substrate. The depot 
surface to which the oligomers are attached can be located on 
a raised feature or in a well-like depression on the surface 
of the supporting substrate. Figure 2 shows oligomers 
attached to a depot site (2) comprising a raised pad 
comprising three different layers ((10), (11), and (12)) 
affixed to a rigid transparent substrate (1) . 

Methods for making arrays comprising oligomers attached 
at depot sites to produce oligomer-s toring devices for the 
present invention are well known. Such methods include in 
situ synthesis of oligomers attached at their 3' ends to a 
functionalized surface such glass, Si02, or GaAs (for example, 
see [20] col. 4, line 67 to col. 10, line 35; [21] col. 23, 
line 3, to col. 25, line 18; and [22] col. 17, lines 21-63). 
Alternatively, pre-synthesized oligomers can be chemically 
attached to the substrate, e.g., by derivatizing the oligomers 
or the attachment surface, and then depositing microdroplets 
of the oligomers at the appropriate depot sites and allowing 
the oligomers to react with the depot site surface, or by 
attaching biotinylated oligomers to a streptavidin-coded 
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surface (see [20] col. 1, line 18 to col. 3, line 13 and col. 
6, line 21 to col. 10, line 35; [25] pages 607-609; [27] col. 
13, lines 2-9; and [28] pages 27-29). Preferably, tlie 
oligomers are attaclied to the depot sites througli uncharged 
spacer groups ((6) in Figure 2) that tether one end of each of 
the oligomers to the depot surface ([22] col. 11, line 49, to 
col. 13, line 45; and [29] pages 5022-24), since the use of 
such spacer groups is known to increase hybridization 
efficiency ([28] page 29). 

Storing Soluble Oligomers in a Depot Array 

Oligomers are stored in the depot array of a storage 
device by allowing them to hybridize specifically to oligomers 
comprising complementary subunit sequences which are attached 
at the depot sites ((2) in Fig. 2), to form double-stranded 
oligomer complexes attached to the depot sites ((7) in Fig. 
2) . Those skilled in the art recognize that the number of 
consecutive complementary nucleotides that must be present in 
an oligonucleotide so that it hybridizes specifically to a 
target nucleic acid molecule can vary considerably, from about 
4 up to 14 or more, depending on such factors as the 
complexity of the set of target nucleic acids and the physical 
conditions (ionic strength, temperature, anionic and cationic 
reagents, etc.) used in the hybridization and wash steps. A 
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complete set of oligonucleotides comprising every possible 
sequence of n consecutive nucleotide subunits can be stored in 
an array of 4-^ depot sites comprising complementary oligomers 
by exposing the array to a solution containing the soluble 
oligomers at a sufficiently low temperature, in a suitable 
buffer containing a high molar concentration of a monovalent 
cation such as Na\ The time required to saturate the 4" depot 
sites with the 4" different n-mer oligomers depends on the 
concentrations of the oligomers, the temperature, and the 
concentration of Na^ ions. If the soluble oligonucleotides 
are applied at a nucleotide concentration of 0.5 mole per 
liter under conditions favorable for hybridization, the time 
for half of the hybridization reaction to be completed is 
about 36 seconds for n = 10, and about 16 hours for n = 15 
[30-33] . 



Releasing Selected Oligomers 

A custom set of soluble oligomers of known composition 
is obtained by locally denaturing double-stranded complexes of 
selected depots of the intact array comprising the desired 
oligomers, and collecting the oligomers released from the 
selected depots ((5) in Fig. 2) into the buffer solution in 
which the array is immersed ((8) in Fig. 2). Denaturation of 
oligomer complexes at selected depots can be achieved by any 
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of the nucleic acid-denaturing treatments known to those 
skilled in the art of nucleic acid biochemistry. Those 
skilled in the art appreciate that the melting temperature of 
a double-stranded oligonucleotide complex is dependent on the 
length, nucleotide sequence, and chemical structure of the 
complex, and on the ionic strength and chemical composition of 
the solvent (see [13] page 11.46). 

The preferred method for denaturing double-stranded 
complexes at the selected depots to release the desired 
oligomers is by locally heating the selected depots so as to 
subject the selected depots to a raised temperature under 
appropriate solution conditions for a period of time 
sufficient to release the desired oligomers from the selected 
depots. Localized heating of the selected depot surfaces can 
be achieved by any suitable means in accord with the structure 
and size of the supporting substrate, and the size and 
disposition of the individual depot sites. For example, 
selected depots can be locally heated by illuminating the 
surface of the array, in a suitable buffer and at a 
temperature below the melting point of the oligomer duplexes, 
with a pattern of focused irradiation from a radiant energy 
source ((4) and (9) in Fig. 2), e.g. an argon laser, that 
heats only those depots storing the desired oligomers. The 
laser can be mounted on a support which provides precise x-y 
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translation control, to permit controlled heating of one depot 
at a time, in serial fashion. Alternatively, the laser can 
have a broad beam that can irradiate a mask, the image of 
which can irradiate all of the depots in the array at once. 
The mask is used to shield the unselected depots so that only 
those depots containing the desired oligomers are heated. To 
heat a single depot having a surface area of about 100 pm^ to 
about 70°C in a suitable buffered solution so as to locally 
melt double-stranded DNA duplexes stored at the heated depot 
will require roughly 10 milliwatts of argon laser light (488 
nm) . In the preferred method, a substrate which is 
transparent to argon laser light, e.g. crystalline AI2O3, is 
used to support thermally isolated, light-absorbing, depot 
surfaces to which the oligomers are attached, allowing back 
illumination of the desired depots as shown in Figure 2, and 
protecting the oligomers from direct exposure to the laser 
radiation. A substrate of AI2O3 is also suitable because its 
relatively high thermal conductivity permits the substrate to 
act efficiently as a heat sink, drawing heat away from the 
irradiated depot sites and so providing greater thermal 
isolation of the unselected depot sites. Alternatively, the 
storage device substrate comprising the depot array could be 
in contact with, or have integrated within it, a controllable, 
addressable, array of resistive heating elements which is 
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spatially aligned with the depot array, so that application of 
current to selected resistive heating elements locally heats 
selected depots proximal to the activated heating elements to 
release the desired oligomers. Heller et al. teach 
fabrication of a silicon substrate into which is integrated a 
micro-array of electronically addressable micro-locations 
corresponding to a micro-array of DNA storage sites ([22] col. 
9-10, 12-16) . Accordingly, it is within the knowledge of 
those skilled in the art of microlithography and thick film 
circuitry to fabricate a DNA chip in which there is integrated 
an array of electronically addressable micro-locations 
comprising resistive heating elements such as can be formed, 
for example, by depositing undoped polycrystalline silicon at 
positions between addressable conducting wire grids [34] . As 
described by Heller et al., metal contact pads along the 
outside perimeter of the chip permit the wiring of such a chip 
comprising an integrated, electronically addressable, micro- 
array to a microprocessor-controlled power supply and 
interface for controlling the device ([22] col. 12). The 
amounts of oligomers released by localized heating can be 
controlled by varying the amount of heat applied, e.g., by 
controlling the intensity of the laser light or the 
temperature of the resistive heater, and/or by varying the 
time period during which heat is applied. According to the 
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preferred method, the localized heating of selected depots to 
release desired oligomers stored therein is electronically 
controlled by a programmable microprocessor and an interface 
for controlling the process. Local heating of selected depots 
will cause oligomer duplexes at the heated depots to melt in a 
short time, of the order of seconds, to yield single-stranded 
oligomers in quantities related to the time and extent of 
heating . 

Heller et al. teach that denaturation of DNA at selected 
depots can also be induced by locally increasing the negative 
electric potential at the selected depots ([22] column 20). 
In addition positively charged chaotropic agents and other 
denaturants can be added to the solution in contact with the 
selected depots to promote denaturation of the attached 
double-stranded complexes. Exposure to denaturing solution 
conditions can be limited to the depots selected for 
denaturation by surrounding the selected depot surfaces with a 
liquid-impermeable barrier that prevents the denaturing 
solution from contacting non-selected depot surfaces. For 
example, individual depots of a large-scale array, in which 
depot surfaces are 0.1 to 10 mm or more in diameter, can be 
situated in wells or surrounded by raised divider walls to be 
fluidically isolated from each other, so that selected depot 
surfaces can be exposed to denaturing solution without also 
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exposing non-selected depot surfaces to the denaturing 
conditions. Denaturation of selected depots^ whether by 
localized heating, application of increased negative 
potential, denaturing solution, or any combination of these 
means, can be carried out serially, one depot at a time, or in 
parallel with multiple depots being treated simultaneously. 

Collecting the Released Oligomers 

Oligomers released from selected depot sites following 
denaturation of double-stranded complexes at those sites ( (5) 
in fig. 2) are collecting by collecting the solution in 
contact with the treated depot surfaces ((8) in Fig. 2). The 
solution in contact with the oligomer-stor ing depot array can 
be enclosed or contained within a reservoir, and once the 
desired oligomers are released into the solution, it can be 
collected by any suitable means, e.g. by a manually operated 
or automated pipetting device, or a syringe. Alternatively, 
the solution containing the desired oligomers can be removed 
from the reservoir and transferred to a suitable collecting 
device, and fresh solution can be added to the reservoir in 
its place, e.g. to wash away residual oligomers in preparation 
for releasing a different set of oligomers, by using automated 
or microprocessor-controlled pumps that direct the flow of the 
different solutions through tubes connected to the reservoir. 
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REPRESENTING VECTOR OPERATIONS WITH DNA OLIGOMERS 

The DNA-based analog methods of the present invention 
permit one to carry out operations of vector addition and 
vector and matrix algebra, including determining the inner and 
outer products of two vectors, determining the product of a 
matrix and a vector, and determining the product of two 
matrices . 

Vectors 

Any m-component vector V in a space with basis vectors 
e^, 1=1, 2,..., m is represented by the equation 

V = Si (1) . 

A subset of all single-stranded DNA n-mers is selected to be 
in 1:1 correspondence with the basis vectors e^, 1=1, 2,..., m 
in an abstract m-dimensional vector space. The analog 
representation of V is then a DNA sample containing strands 
Ei, with the concentration of each strand [EJ being 
proportional to the amplitude of the i-th component of the 
vector. For example, a typical 10-mer, E. = 5 * AGCTATCGAT 3\ 
can be associated with the basis vector e^ identifying one 
direction in a space of at most 4^° = 10^ dimensions. The 
analog accuracy of representing a vector V in this manner will 
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be limited by Poisson fluctuations in the numbers of molecules 
in a finite sample volume; roughly 1 pmole of DNA oligomers 
would represent a random vector for m=10^ with roughly 1% 
errors in the individual amplitudes. 

Since DNA concentrations are always positive, an 
appropriate representation for negative amplitudes is needed 
[10] . In the present invention, negative amplitudes 
associated with unit vectors e. are represented by DNA 
oligomers having a nucleotide sequence complementary to E^. 
As a result, when two vectors are added, any positive and 
negative amplitudes will hybridize, and the resulting double- 
stranded DNA oligomers can be removed from the set of single- 
stranded DNA molecules; for example, by digestion with a 
suitable enzyme, or by column separation. 

The nucleotide sequences of the DNA oligomers of the 
present invention, and the conditions for their reaction, are 
selected to optimize interactions between the DNA oligomers 
that are analog representations of the vector and/or matrix 
operations of interest. For example, the n-mers can be 
synthesized with an invariant r-mer R^Rs ...R^ at their 5' 
ends, and an invariant r-mer S^S. ...S,, at their 3' ends, with 
r independently being about 3-6, to assist in hybridization 
reactions involving n-mer termini which are employed in 
representing operations such as determination of the inner 
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product of vectors V and W, as described below. Additionally, 
the DNA n-mers can be synthesized to have one half of a 
palindromic restriction enzyme recognition sequence at each 
end, to permit cleavage that separates pairs of strands that 
have been joined end-to-end for operations such as 
determination of the product of a matrix and a vector, also as 
described below. To prevent introduction of error into the 
operations by undesired interactions between DNA oligomers 
which are not fully complementary, the nucleotide sequences of 
the DNA n-mers are preferably selected so that the DNA n-mers 
are non-palindromic, relatively free of hairpin effects, and 
have minimal overlap with the other basis vectors [12]. 

A suitable choice for a set of n-mers having structures 
which are useful for the present invention is a set of single- 
stranded (q + 2r + 6)-mers of the form: 

Ei = 5' TAG R1R2 . . . Rr N/N2-N3^ . . . S1S2 . . - S, GTA 3* (2), 

where the core sequence Ni^ . . . N.^ . . . N^^ of length q nucleotides 
is associated with the basis vector . When such strands are 
joined end-to-end, the nucleotides at the junction form a 
palindromic restriction enzyme recognition sequence having the 
sequence 5' GTATAC 3', which, when hybridized to its 
complement to form a double-strand, is cut in the center by 
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the restriction enzyme Bstll07 I [35]. Negative amplitudes 
associated with unit vectors are represented by the 
sequence of bases complementary to E^^ e.g. 

E, = 5' TAG . . . S2S:N^^ ... N3%'Ni^ R, . . -RsE: GTA 3' (3). 

Since each nucleotide of the core q-mer is chosen from 
the four bases A, T, the number of such sequences is 4"^. 

If 4"^ >> m^ it is possible to select a subset of q-mers which 
will give a set of basis vector strands that are non- 
palindromic^ relatively free of hairpin effects and have 
minimal overlap with the other basis vectors [12]. 

Addition of Vectors 

Addition of two vectors can be carried out as follows: 
(I) Equal quantities from the two collections of DNA 
representing the two vectors, at twice the standard 
concentration, are combined under solution and temperature 
conditions that allow only fully complementary n-mer strands 
to hybridize to form stable double-stranded DNA complexes. 
Positive and negative contributions to the concentration of 
oligomers corresponding to any given basis vector hybridize to 
yield double-stranded DNA with blunt ends. After the reaction 
is complete, the positive and negative type DNA n-mers for 
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each basis vector which remain as single-stranded oligomers 
represent the sum of the two vectors. 

(II) The double-stranded DNA oligomers are separated from 
the single-stranded DNA n-mers of the same length by a high- 
performance liquid chromatography (HPLC) purification step. 
Alternatively, double-stranded DNA oligomers can be separated 
from the single-stranded ones by digesting the DNA with an 
appropriate enzyme such as Exonuclease III (E. coli) , that 
cleaves double-stranded DNA but not does not cut single- 
stranded DNA [36] . Following digestion of the DNA with 
Exonuclease III, the reaction mixture contains the single- 
stranded DNA oligonucleotides, plus unwanted DNA fragments 
that are significantly shorter than the DNA strands 
representing the vectors. The set of intact single-stranded 
DNA oligomers is then purified; for example, by HPLC, or by 
gel electrophoresis and elution of the DNA from the gel. The 
set of intact single-stranded DNA oligomers obtained after 
mixing the two DNA sets and removing the double-stranded DNA 
molecules as described above is an analog representation of 
the sum of the two vectors. The sum of any number of vectors 
can be taken simultaneously in the same manner. 

Multiplication of any of the individual vectors by a scalar is 
represented by adjusting the concentration of the DNA 
molecules corresponding to that vector. 
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Inner Product of Two Vectors 

The inner product of two vectors can be found as 

follows : 

(I) Three separate samples of each of the two collections 
of DNA n-mers representing the individual vectors and are 
obtained . 

(II) A first pair of samples of the and vectors is 
combined and the rate of hybridization, R_, is measured. The 
value of R_ is proportional to the time rate of increase of V- 
W duplex strands representing quantities of opposite sign. 
The individual contributions to R_ are proportional to the 
product of the concentrations of the two V and W strands^ and 
hence are proportional to the inner product. The 
concentration of double-stranded DNA is measured, e.g., by 
treating the DNA mixture with a fluorescent dye such as 
ethidium bromide that intercalates the double helix. The DNA- 
ethidium bromide complex fluoresces at 590 nm when excited by 
300-nm light, and thus provides a quantifiable signature for 
the concentration of double stranded DNA. 

(III) A V sample and a W sample are each treated to modify 
the 3' ends of the strands so that additional nucleotides 
cannot be attached in a polymerase-catalyzed extension 
reaction. For example, dideoxynucleotides can be added to the 
3' termini of the V and W strands in a reaction catalyzed by a 
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terminal transferase. Alternatively, a short oligomer having 
a non-extendable nucleotide (e.g., a dideoxynucleotide ) at its 
3' end can be ligated to the 3* ends of the V and W strands, 
using bridging linker oligomers, followed by separation of the 
modified V and W strands from the shorter oligomers. 

(IV) The modified V and W strands are each incubated 
separately with DNA polymerase in a suitable buffer and the 
two primers, 

5* AATGCAAGATCGAAATTTATACGTTTATCT TAG . , . S2S1 3' (A), 

and 

5' AATGCAAGATCGAAATTTATACGTTTATCT TAC R.R2 . . . 3 ' (B) , 

where 5* AATGCAAGATCGAAATTTATACGTTTATCT 3' exemplifies a long, 
inert strand that does not hybridize with any of the Vi or W. 
strands and form a stable double-stranded complex under the 
conditions used in the subsequent reaction steps. Of course, 
primer strands having other nucleotide sequences that do not 
hybridize with any of the V; or strands can be used instead 
of those shown above as (A) and (B) , with equal success. The 
3' ends of the long primer strands are extended on the V and W 
templates, producing the complements V and W to all the V and 
W strands present (Figure 3) . 

(V) The V and W complementary DNA oligomers produced by 
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extension of the long primers (A) and (B) are separated from 
the shorter V and W template strands by HPLC to yield the V 
and W strands . 

(VI) The third pair of samples of V and W DNA strands is 
used in combination with the V and W strands obtained in the 
previous step to measure the rate of hybridization R^. , V 
strands are mixed with and V with W, and the rate of 
hybridization R+ is measured for each reaction. The value R+ 
should be the same for each reaction; when the two rates may 
differ, e.g., due to effects of sequence-dependent oligomer 
structure on the average melting temperature of each strand 
mixture, the average of the two rates can be obtained and used 
as R+ . 

(VII) The inner product of the two vectors is represented by 
the suitably normalized difference of the rates, R+ - R_, where 
each rate R+ and R_ is suitably normalized to correct for 
concentration differences . 

Outer Product of Two Vectors 

The outer product matrix ViW. is formed by joining the 3' 
ends of the single-stranded DNA oligomers corresponding to V. 
to the 5' termini of the DNA oligomers corresponding to W. . 

(I) To ensure that the V. and Wj strands are attached to 
each other in the proper orientation, the 5' phosphate 
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residues are removed from the oligomers, e.g., using 
bacterial alkaline phosphatase, and the 5' termini of the Wj 
are phosphorylated, e.g., using bacteriophage T4 
polynucleotide kinase . 

(II) The Wj strands are further modified by ligating to 
their 3* termini a long, inert strand {F} that does not 
hybridize with any of the or Wj strands and form a stable 
double-stranded complex under the conditions used in the 
subsequent reaction steps. The modified strands and the 
{F} strands are ligated using bridging linker oligomers having 
the structure: 

5' F, ... F5F2F1 TAG . . . S2S: 3' (C), 

and 

5* F^ ... F3F2F1 TAG R:R2 . . . 3 ' (D), 

where h is about 6-9, such that the h nucleotides at the 5* 
ends of the bridging linker strands are complementary to the 
first h nucleotides at the 5* ends of the {F} strands. The 
bridging linkers (C) and (D) thus hybridize to the 3' terminal 
portions of the modified strands and the 5' terminal 
portions of the F strands and align them end-to-end for 
efficient ligation to obtain strands of the form {Ej}{F} and 
{Ej}{F}, which are purified from the shorter bridging linker 
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oligomers . 

(Ill) The modified V. and Wj{F} strands are then ligated 
together using the four types of bridging linker strands: 



5 5' SiS, . . . S, GTATAC S, . . . S^S^ 3' (E) , 

5' R, ... R.Ri GTATAC ... S^S^ 3' (F), 

5' R, ... R2R1 GTATAC R^R, . . . R, 3 ' (G) , and 

5* S1S2 . . . S, GTATAC R^R, . . . R, 3 ' (H) , 



10 which hybridize to the 3* terminal portions of the strands 

p and the 5' terminal portions of the W. strands and align them 

M end-to-end for efficient ligation to obtain strands of the 

m form {Ei}{Ej}{F}, {Ei}{E.}{F}, {E.KE^KF}, and {Ei}{Ej}{F} 

in (Figure 4). The number of ij strands is proportional to the 

1§^ product of the concentrations of the V: and W. strands and 

hence to the desired outer product. This is approximately 
true even if the reaction is allowed to run to completion, 
•l^i since there are so many different reactions competing for the 

same strands. DNA strands of the form EiE, will spontaneously 
20 form hairpins and may be removed by a first purification stage 

using gel electrophoresis or chromatography at room 
temperature. The remaining diagonal components of T:^, of the 
form EiEi and EiE. (e.g. T^, T22, T33 ...) may be removed if 
desired by allowing the strands representing Ti^ to hybridize 
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with an equal total concentration of strands representing a 
unit matrix 5^., having only diagonal components. The desired 
T^^ strands lacking any diagonal components may then be 
extracted by a second purification stage performed under 
temperature and solution conditions selected such that single 
duplex pairs of E^s melt, but double length hybridized 
segments corresponding to T^^ 6^^ remain in double-stranded 
form. 

Product of a Matrix and a Vector 

This method allows one to find the matrix inner product 
T^j Xj, given a matrix T represented by strands of the form 
{EJ{Ej}{F}, {Ei}{Ej}{F}, {Ei}{Ej}{F}, and {EJ{E.}{F}, and a 
vector X represented by strands of the form {E.} and {E^} 
having concentrations proportional to the amplitudes X. . The 
strands {Ei} and {Ei} of vector X will be referred to as {X^} 
to distinguish them from the strands of the matrix T, 

(I) Obtain a sample of the complement to {Xj}, which is 
{X^} . 

(II) To the 5' ends of both the {X^} strands and their 
complements, ligate DNA strands {G} which are about twice as 
long as the {F} oligomers ligated to the 3' termini of the 
matrix strands, and which are complementary in their 3' halves 
to the {F} oligomers. The {G} strands and the {X^} strands 
are ligated using bridging linker oligomers having the 
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structure : 



5 ' R, ... R2R1 GTA G;, ... G3G2Gi 3 ' ( I ) , 

and 

5' S: ... S2S, GTA Gh ... G3G2G, 3^ (J), 

where h is about 6-9, with the h nucleotides at the 3' ends of 
the bridging linker strands being complementary to the last h 
nucleotides at the 3' ends of the {G} strands. The bridging 
linkers (I) and (J) hybridize to the 3' terminal portions of 
the {G} strands and the 5' terminal portions of the {X.} or 
{Xj} strands and align them end-to-end for efficient ligation 
to obtain strands of the form {G}{Xj} and {G}{Xj}, 
respectively, which are purified from the shorter bridging 
linker oligomers . 

(Ill) One of the resulting two strand collections, {G}{X,} 
is incubated with a sample of the matrix strands and an enzyme 
with ligase activity, plus the set of (r+3)-mers: 



5^ TAG S. 



S2 S - 3 ' 



5 ' TAG R-R2 . . . R. 3 ^ 



(K) , and 
(L) . 



Ligation of the (r+3)-mers to the 3' ends of the {G}{Xj} 
strands that are hybridized to matrix strands results in 
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formation of double-stranded recognition sites for the 
restriction enzyme BstllO? I. The ligase is then inactivated, 
and the newly formed double-stranded restriction enzyme 
recognition sites are cut using BstllOV I enzyme, resulting in 
release of a set of {V^} strands from the matrix strands 
(Figure 5) . Purification of the short {V,} segments yields a 
collection of {V.} strands of the form {E.} and {E^} 
representing an unchanged sign contribution to the product. 

(IV) A second sample of the matrix is treated to modify 
the 3' ends of the strands so that additional nucleotides 
cannot be attached in a polymerase-catalyzed extension 
reaction. For example, dideoxynucleotides can be added to the 
3' termini of the V and W strands in a reaction catalyzed by a 
terminal transferase, as discussed above in the description of 
determination of an inner product. The modified matrix 
strands are then incubated with the other of the two strand 
collections, {G}{Xj}, and a DNA polymerase capable of primer 
extension, e.g., T4 DNA polymerase, in a buffer solution 
suitable for polymerase-catalyzed primer extension. 
Incubation results in the extension of the {Xj} strands at 
their 3' ends, using the {Ei} and {E^} strands at the 5* ends 
of the matrix DNAs as templates, to produce strands of the 
form {G}{Xj}{V.}. The strands are melted and the longer 
{G}{Xj}{Vi} segments are separated. These strands are then 
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hybridized with the set of four linker strands (E) , (F) , (G) , 
and (H) described above, and the resulting double-stranded 
restriction enzyme recognition sites are cut using Bstll07 I 
enzyme, resulting in release of a set of {V^} strands. The 
strands are separated, e.g., on the basis of their size, to 
yield a collection of {V.} strands of the form {Ei} and {E.} 
representing a changed sign contribution to the inner product 
(Figure 6) . 

(V) The {Vi} strands from step (IV) above are added to the 
{Vil strands obtained in step (III) above, using the 
previously described vector addition routine, to give a set of 
single-stranded DNA oligomers that is an analog representation 
of the desired matrix inner product. 

Product of two Matrices 

Oliver [10] describes an analog method for obtaining the 
product of two matrices containing only positive numbers, the 
disclosure of which is incorporated herein by reference. The 
method taught by Oliver can readily be extended to include 
negative numbers represented by complementary oligomers as 
described above for other vector and matrix algebra methods, 
and can be modified so that the product matrix has the same 
form as the starting matrices. However, for the neural 
network applications, this operation is not useful because the 
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nonlinear amplification does not commute with matrix 
multiplication . 

Input and Output Using a Hybridization Array 

Vector operations, including vector addition and vector 
algebra, represented by the interactions of selected sets of 
DNA strands as discussed above, are preferably carried out 
using DNA hybridization arrays to provide the sets DNA strands 
representing positive and negative vectors needed to solve a 
given problem, and to analyze the sets of DNA strands 
representing a solution to the problem. The preferred methods 
for "writing" vector data as sets of DNA strands, and for 
"reading" sets of DNA strands to derive the vector data they 
represent, employ DNA hybridization micro-arrays attached to 
one or more DNA chips. The heightened efficiencies with which 
DNA strands representing vectors can be provided and detected 
using DNA chips permit practical implementation of the methods 
of the present invention in making a DNA computer. 

A general method for "writing" digital data corresponding 
to an m-component vector V into analog form as a set of DNA 
strands and E^, wherein the DNA strands representing V are 
released from a DNA chip serving as an oligomer storing 
device, has been described above. Double-stranded DNA 
complexes of selected oligomer-storing depots in a micro-array 
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on a DNA chip are locally denatured, e.g., by localized 
heating, and the desired soluble oligomers are collected from 
the solution in contact with the storage micro-array (see 
Figure 2) . 

In the following examples, digital data in the form of an 
image of n x n picture elements, or pixels, is written, i.e., 
it is converted to analog information in the form of a set of 
DNA strands. The analog information is then read, i.e., it is 
converted back into image form. Although the example below 
demonstrates "writing" and "reading" image data, other types 
of data could just as readily be assigned to the vector DNA 
strands and interconverted from digital to analog form and 
back again in a similar manner. Since image data is being 
stored, the individual depot sites in the DNA micro-array are 
referred to as "pixels" in the following example. 

(A) A set of DNA strands and representing vector V 
that corresponds to digital data in the form of an image of n 
X n picture elements is provided as follows: 

A DNA chip is obtained which supports a micro-array of 
oligomer depot sites, or pixels, at which are anchored single- 
stranded DNA oligomers having unique nucleotide sequences of 
from about 10 to 100 or more nucleotides in length, where 
there is a 1:1 correspondence between the maximum image size 
of n X n pixels and the number of pixel sites in the micro- 
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array at which single-stranded oligomers are tethered, and 
where the micro-array includes a set of pixel sites at which 
are anchored single-stranded DNA oligomers that are 
complementary to the DNA strands of and E ^ . The pixel array 
supported by the DNA chip can comprise from about 10^ to 10^ or 
more pixels sites, at which are anchored as many different 
types of DNA oligomers. The pixels of the micro-array can 
have a diameter of from 4 to 50 /^m or more, and the density of 
the single-stranded DNA oligomers anchored at each pixel site 
is about 10^' DNA molecules per ^m^ . Saturating amounts of DNA 
strands complementary to the single-stranded DNA oligomers 
tethered to the pixels of the micro-array are then hybridized 
to the pixel array. 

(B) Each data image is flashed on the DNA micro-plate using 
focused radiant energy, e.g., from an argon laser, so as to 
cause local heating that melts a portion of the double- 
stranded DNA at a specific set of pixel sites that corresponds 
to the particular set of pixels which make the image. The 
single-stranded DNA oligomers released by melting of the 
duplex oligomers at each heated pixel yield a number n^ of E^ 
or Ei molecules proportional to the image intensity at that 
pixel. By calibrating the relationship between the amount of 
radiant energy applied and the number of DNA molecules of a 
given sequence that are released, several bits of analog 
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amplitude information about the image can be encoded into the 
concentrations of each of the E. or oligomers collected. 

(C) The micro-plate is washed and the set of DNA oligomers 
that was released by heating the selected pixels is collected. 
The total number of collected DNA molecules that represent a 
data image is N = n^. To subtract the average image 
intensity, as needed for orthogonality, add N/2 molecules 
randomly distributed over the complete set of E^, using the 
vector addition algorithm. In working with real images, it 
may be desirable to use a micro-array having only E^^ 
(positive) strands. The set of DNA strands can be amplified, 
e.g. by PGR , and then separated from the amplification 
primers, when it is desirable to have more copies of each DNA 
strand than are obtained directly from the micro-array by the 
above method. 

The inverse step of "reading" involves incubating a 
liquid sample containing the set of DNA oligomers that 
represents the data with a DNA chip supporting a pixel micro- 
array of single-stranded oligomers; i.e., a DNA array from 
which all non-covalently attached oligomers are stripped. The 
methodological steps and reaction conditions used for 
hybridizing the DNA oligomers representing the data to their 
complementary strands in the pixel array are essentially the 
same as those used for saturating the array with oligomers as 
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discussed above. The quantity of different DNA oligomers 
captured at each pixel of the micro-plate is proportional to 
the concentration of the DNA oligomers in solution, for a 
short enough exposure, and can be determined by exposing the 
micro-plate to a solution of ethidium bromide and detecting 
the fluorescence emitted by ethidium bromide bound to the 
double-stranded DNA molecules in the pixel array on the micro- 
plate surface. One skilled in the art would appreciate that 
the image corresponding to the set of pixels containing 
double-stranded complexes can readily be obtained by using 
other reporter molecules that emit a detectable signal 
indicating formation of double-stranded complexes in the 
array; e.g., a flourescent reporter group can be attached 
directly to each of the data oligomers prior to incubating 
with the read-out micro-plate (for example, see [20] col. 10- 
11; [22] col. 20 lines 40-51; [23] page 40; [28] pages 28-30; 
[29] page 5024; [37] col. 5, line 16, to col. 5, line 56; and 
[38]). Alternatively, other detection methods, such as mass 
spectroscopy, are Icnown and can be used for identifying the 
sites on a DNA chip that contain double-stranded complexes 
(for example, see [28] page 30]), 

Since the rate of hybridization of the soluble oligomers 
to their tethered complements is proportional to the 
concentration of the soluble oligomers, the set of soluble DNA 
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strands can be amplified, e.g. by PGR, prior to hybridizing to 
the chip in order to increase the rate of hybridization, and 
to give a stronger overall signal. 

IMPLEMENTATION OF A HOPFIELD NEURAL NETWORK 
Memorization and Recall 

The present invention also includes methods for DNA 
analog representation of a neural network that make practical 
use of the massive parallelism possible with nucleic acid 
computing [1, 2, 11]. The invention implements a neural 
network by using DNA oligomers having selected nucleotide 
sequences representing positive and negative vectors as the 
neurons; by letting diffusion, specific hybridization of 
complementary oligomer sequences, and nucleotide sequence- 
specific reactions of DNA-modif ying enzymes, as employed in 
the analog vector operations discussed above, serve as the 
connections and signaling between neurons; and by using 
hybridization to an array of single-stranded DNA oligomers 
having selected nucleotide sequences as a saturating function 
that gives, after one or more iterations, the output signal 
corresponding to the input activity [Penny et al. in [8], 
pages 386-387]. As an illustrative example, the present 
invention is described in its implementation as a DNA-based 
representation of a Hopfield neural network, which may be used 
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to make an associative or content addressable memory [3] . 
As described by Hopfield [3], elements of memory are 

represented as m-component vectors V = (Equation 1) . 

The items of experience, e.q,, data sets or images, 
represented by a set of vectors V^^ are stored in memory by 
forming the outer product over all the experience vectors for 
i^j : 



The condition T.^ = 0 is required along with T.j = T.^ for 
unconditional stability [39] . Recall of a particular 
experience V^^ imperfectly represented as U^*^ is effected by the 
algorithm 



where the function S(x) is a saturating function acting 
separately on each component of the vector ^ T^. V. + Ui'^. The 
statement that S(x) is a saturating function means that S(x) 
is one of class of functions S(x) that are monotonically 
increasing with x and that have a maximum and a minimum value, 
respectively, for large and small values of x. Such functions 
include, for example, tanh(x), x/sqrt (a+x^) , x/nth root of 



(4), 



V, = S T,^ Vj + Ui^) 



(5) , 
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(a+x") where n is even, and the step function S=-l for x<0, 
S=l for x>0. In practice, S(x) includes an additional term g 
that represents the small-signal gain; for example, S(x) = g • 
tanh(x). Solutions to Equation 5 are typically found by 
iteration starting with V. = 0, with the small gain g being 
adjusted to facilitate convergence. If the V^" are 
sufficiently different, i.e. are part of a nearly orthogonal 
set, the system will settle into a state closely resembling V^^ 
after a sufficient number of iterations of the method. 
Hopfield found that the number of memories that can be stored 
simultaneously is roughly 10% of the number of independent 
basis vectors. 

Each vector V =^ V.e: in the neural network can be 
represented by a set of single-stranded DNA oligomers; e.g., 
by a set of (q + 2r + 6)-mers of the form E^ (Equation 2) and 
Ei (Equation 3), representing positive and negative vectors, 
respectively, with concentrations [EJ proportional to the 
amplitudes V^, as described above, A Hopfield neural network 
can be implemented as a content-addressable memory by a series 
of steps leading to identification of a complete image 
represented by an experience vector V.*", given an imperfectly 
represented vector Ui'^ that accurately contains a sufficient 
fraction of the information of the complete vector V^*^. Using 
the methods of the invention, the complete V^^ vector can be 
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recalled, given as little as a few ten-thousandths of the 
information contained in V^'^. The number of iterations of the 
neural network algorithm needed to give convergence to the 
complete V,'" vector is reduced by using containing a greater 
portion of V,*^. 

The memory matrix T^^ defined in Equation (4) is the sum 
of all of the outer products V,^ V^^ of all the experience 
vectors V^^ for i^j, and can be represented by forming and 
pooling all of the outer product DNA strands of the form 
{Ei}{E^}{F}, {Ei}{E,}{F}, {Ei}{Ej}{F}, and {E,}{Ej}{F}, where 
i^^j, for each experience vector V.^ (see Figure 7). 

Given a vector Ui'^ that imperfectly represents a 
particular experience V,^, the complete experience vector V^^ 
can be recalled from the content addressable memory, as 
depicted in Equation 5, by finding a set of DNA strands X. 
corresponding to the inner product of the T^. matrix and the 
vector Ui^, and then implementing the saturating function 
S(XJ, by which a selected set of DNA strands representing the 
vector Xi is captured by hybridizing to a complete, sub- 
stoichiometric set of single-stranded Ei and E^ strands. The 
set of E. and E. strands used to apply the saturating function 
can be anchored to a solid support such as a set of beads, a 
polymeric matrix (e.g., in a chromatography column), or a 
silicon or AI2O3 chip, or they can be free in solution, e.g. 



48 



with each saturating oligomer being linked to a ligand or an 
additional oligomer to facilitate isolation of the set of 
strands selected by the saturating reaction. The 
unhybridized, single-stranded strands are then washed away 
or are otherwise separated from the double-stranded complexes 
formed by hybridizing the X^ strands to the set of saturating 
E. and Ei strands, and a set of oligomer strands representing 
the saturated X^ strands, S(X.), is obtained by denaturing the 
duplex molecules containing the S(Xi) strands, e.g. by 
heating, and collecting the selected set of single-stranded X^ 
oligomers released by the denaturing treatment. 

In a preferred embodiment, the single-stranded E^ and E^ 
strands used to apply the saturating function are tethered to 
form a hybridization array on a DNA chip. The X^ strands are 
hybridized to the sub-stoichiometric set of tethered Ei and E, 
strands, the hybridization array is then washed to remove 
excess X. strands, and a set of DNA strands representing the 
saturated X, strands, S(XJ, is obtained by heating the duplex 
molecules of the hybridization array containing the S(XO 
strands, and collecting the set of single-stranded DNA 
oligomers that are released by the denaturing treatment. 

Following collection of the S(Xi) strands, the steps 
corresponding to Equations (4) and (5) described above are re- 
iterated by again applying the saturating function to the set 
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of DNA strands corresponding to the inner product of the T^^ 
memory matrix and the saturated vector. The information 
(e.g., the image) represented by the set of oligomer strands 
S(XJ produced by applying the saturating function to in one 
or more iterations can be derived by letting the selected set 
of Xi oligomers hybridize to an array of complementary and 

strands attached to a DNA chip, and detecting the 
double-stranded DNA molecules hybridized to the tethered 
oligomer array, in the same manner as described above for 
"reading" a set of DNA oligomers representing the solution to 
a vector operation. For example, by labeling the double- 
stranded complexes with ethidium bromide, or by labeling the 
Xi oligomers with a fluorescing moiety before hybridizing to 
the DNA chip, and then by identifying the pixels that produce 
fluorescence upon illuminating the chip with light of the 
appropriate wavelength . 

To promote rapid convergence to obtain an image or data 
set consisting of binary data, i.e., of a series of Os and Is, 
it is preferred that the set of tethered E^ and E^ strands that 
make up the hybridization array be sub-stoichiometric relative 
to the set of complementary X^ strands, so that the saturating 
function serves to equalize the number of different E. and E^ 
strands in the set of DNA strands representing S(XJ released 
from the micro-array. Depending on the length of the query 
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vector, two or more iterations of Equations (4) and (5) are 
sufficient to force the mixture into a steady state answer to 
the query, which is attained when two successive iterations of 
the neural network process described above yield the same 
complete image or set of data (Figure 8) . 

In the preferred method, the hybridization reactions 
associated with forming the inner products of T^. and U^^ or 
are carried out using concentrations of U^^ and oligomers 
that are 1-4 times as great as the concentration of the T^^ 
oligomers, in order to reduce the time required for the 
hybridization reaction. Similarly^ when applying the 
saturating function to X^, saturation of the oligomer binding 
sites in the array occurs more rapidly when the X^ strands are 
present in stoichiometric excess relative to the complementary 
strands in the array. The preferred method may therefore 
include a step of amplification of the DNA strands 
representing U^^ and X., e.g., by PGR, prior to the steps of 
forming the inner products of T^j and U^^ or X^, or prior to 
applying the saturating function to X^, in order to increase 
the rates of hybridization associated with each of these 
steps. Increasing the copy number of the X^ oligomers by PGR 
amplification, or changing the concentration of X. oligomers 
required to saturate the binding sites on the hybridization 
plate, e.g., by altering the number of complementary oligomers 
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tethered to the pixels of the DNA chip, correspond to 
adjusting the small-signal gain parameter g of the saturating 
function, as in S(x) = g • tanh(x). 

Sample Volume 

Each DNA strand representing an outer product matrix V^^Vj^ 
is of the form {E.}{E,}{F}, and so is 80 nucleotides long when 
the basis vectors are represented by DNA strands of 32 nucle- 
otides (q = 20), and the F oligomers are 16 nucleotides long. 
A sample of memory T^j containing 10 copies of each of 10"^ 
outer product records Vi"" V.^, where each vector V^^ is repre- 
sented by up to 10^ basis vector 80-mer DNA strands, contains 
10 x 10' x (10' x 10') = 10^' DNA strands, and 80 x 10'^^ = 8 x 
10^^ nucleotides. The volume of such a sample of memory T^j for 
which the nucleotide concentration is 1 M is about 1,3 ml. 

Query Time 

The query time of the neural network algorithm is 
determined by the rate at which single stranded DNA oligomers 
representing the memory matrix T^j and the query vectors U^^ and 
Xi of Equations (4) and (5) hybridize to form double-stranded 
DNAs. The rate of hybridization of complementary single- 
stranded DNA molecules is proportional to the total nucleotide 
concentration, and inversely proportional to the amount of 
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unique nucleotide sequence in the DNA, and depends also on the 
ionic strength and temperature [30, 33] . Under a given set of 
temperature and ionic strength conditions, the hybridization 
rate can be accurately predicted when the number of copies of 
each DNA present in the solution are known [30-33] . For a 
solution containing 10' different double-stranded nucleotide 
sequences and having a total nucleotide concentration = 1 M, 
with temperature and ionic strength selected to be favorable 
for hybridization, the time required for one-half of the 
strands to re-hybridize following denaturation is about 580 
seconds, about 10 minutes. 



FEED -FORWARD NETWORKS 

A multilayer feed-forward network with sigmoidal neuron 
response functions [7], for example S(x) = tanh(x), and at 
least one hidden layer of neurons, is able to represent at its 
output any continuous function of its inputs [40] . Such a 
network can be trained on known input-output pairs by the 
back-propagation of errors using an algorithm that is written 
in the language of matrix algebra [7]. A useful neural 
network of this type may thus be implemented with the analog 
vector algebra scheme outlined above; the learning algorithm 
involves multiplication by the derivative of the response 
function. Such a learning algorithm can be implemented by a 
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network with one hidden layer with neuron outputs denoted H, , 
an input layer with outputs and an output layer with 
outputs 0^, wherein the input and hidden layers are connected 
by weights IH^j and the hidden and output layers are connected 
by weights HOi. . The equations for the network are thus 

Hi = S(l^ in,. IJ (7) 

and 

Oi = S(X HO,j HJ (8) . 



The generalized back-propagation learning algorithm [7] is: 

(1) Start with a set of random values for the weights. 

(2) Present the network with a training stimulus having 
a desired 0^^. 

(3) Make changes to the HO weights 

Aho,. = II Ai"^ (9) , 

where 

A,"^ = (0,^ - OJ S' (OJ (10) , 

and where ri is a learning rate parameter. The derivative 
S*(Xi) of the sigmoidal function is proportional to the 
difference S(Xi 4-5) - S(Xi - 5) obtained by adding and 
subtracting a constant 5 from all the components of before 
implementing the sigmoidal function S, The IH weights are 
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changed according to 



AlH,3 = n A,^" Ij 



(11) 



where 



(12) 



The back-propagation algorithm is not very efficient, but 
constitutes a proof of principle for implementing a DNA 
version of a multilayer feedforward network- 
Combined with new rapid techniques for the interconver- 
sion of digital data and analog DNA information, it should be 
possible to construct a DNA neural network having a cycle time 
of the order of an hour, and connectivity of the order of a 
few percent of the number of synapses of the human brain. 
While such a network could be imitated using a digital silicon 
super computer, one of the advantages of the molecular 
approach of the present invention is its small size and low 
price. On the other hand, an analog VLSI representation of a 
neural network on a single silicon chip is limited presently 
to less than about 10^ synapses. The disadvantages of using a 
non-solid state molecular computing device, for example, the 
time required to carry out the selection and synthesis of the 
necessary oligomers, and for biochemical steps such as 
separation of single-stranded from double-stranded oligomers, 
will disappear upon serious development of the various 
techniques involved in its implementation. 
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Baum has proposed using DNA operations such as those 
taught by Adleman [1] and Lipton [3] to produce an associative 
DNA memory of enormous capacity [11], as noted above. While 
the neural network design of the present invention would not 
exceed this capacity when used as an associative memory, it 
could also be configured in more general architectures for 
solving problems of prediction and classification [7, 40] . 
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WE CLAIM: 

1. A method for analog representation of the amplitudes 
of a vector, 

wherein single-stranded oligomers and E^ are a subset 
of all single-stranded oligomers and are each in 1:1 
correspondence with the basis vectors e., i = l,2,...,m in an 
abstract m-dimensional vector space; 

wherein a set of the oligomers E. and E^ represents an m- 
component vector V = e^, wherein the E, and E^ oligomers 

have complementary nucleotide sequences, with the E^ oligomers 
representing the i-th component of V for which the amplitude 
Vi is positive, and the Ei oligomers representing the i-th 
component of V for which Vi is negative; and 

wherein the concentration of each of the oligomers E, or 
E^ is proportional to the magnitude of the amplitude of the 
i-th component of V. 

2. The method of claim 1, wherein the oligomers 
independently comprise subunits selected from the group 
consisting of deoxyribonucleot ides , ribonucleotides, and 
analogs of deoxyribonucleot ides or ribonucleotides; and any 
single oligomer comprises one or a combination of two or more 
of said different types of subunits. 
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3. A method for analog representation of the operations 
of vector and matrix algebra, 

wherein each vector is represented by a set of the 
oligomers E, and E. according to claim 1, and 

wherein the operations of vector addition and vector and 
matrix algebra are represented by biochemical processes and 
reactions involving said oligomers E^ and E^, comprising 
diffusion, molecular recognition, specific hybridization of 
complementary oligomers, and sequence-specific reactions of 
nucleic acid-modifying enzymes acting on the oligomers. 

4. The method of claim 3, wherein the oligomers 
independently comprise subunits selected from the group 
consisting of deoxyribonucleotides , ribonucleotides, and 
analogs of deoxyribonucleotides or ribonucleotides; and any 
single oligomer comprises one or a combination of two or more 
of said different types of subunits. 

5. A method for implementing an analog neural network, 
wherein data of each neuronal unit, in the form of m- 

component vectors V = 2, V. e^, are each represented by a set 

of the oligomers E. and E^ that are a subset of all single- 
stranded oligomers and are each in 1:1 correspondence with the 
basis vectors e^, i = 1,2,..,, m, in an abstract m-dimensional 
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vector space; 

wherein a set of the oligomers E, and E. represents an m- 
component vector V = 2. V, e,, wherein the E^ and Ei oligomers 

have complementary nucleotide sequences, with the E^ oligomers 
representing the i-th component of V for which the amplitude 
Vi is positive, and the E. oligomers representing the i--th 
component of V for which is negative; and 

wherein the concentration of each of the oligomers E^ or 
E. is proportional to the magnitude of the amplitude of the 
i-th component of V; 

wherein the interconnections and signaling between 
neuronal elements are represented by a set of biochemical 
reactions involving the oligomers E. or E^ that are analog 
representations of operations of vector addition and vector 
and matrix algebra; and 

wherein application of a saturating function to a signal 
from one or more neuronal units to produce an output is 
represented by hybridization of a set of oligomers selected by 
said set of biochemical reactions to a complete, sub- 
stoichiometric set of single-stranded E, and E. oligomers, and 
an output of the neural network is represented by a set of 
oligomers that specifically hybridize to said sub- 
stoichiometric set of E^ and E. oligomers. 
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6. The method of claim 5, wherein the oligomers 
independently comprise subunits selected from the group 
consisting of deoxyribonucleotides , ribonucleotides, and 
analogs of deoxyribonucleotides or ribonucleotides; and any 
single oligomer comprises one or a combination of two or more 
of said different types of subunits. 

7. The method of claim 5, wherein a content 
addressable memory is represented by a pool of oligomers 
having selected sequences, and a subset of the oligomer 
strands representing a particular experience V^^ is used to 
obtain the set of oligomer strands representing the complete 
experience V^"", comprising the steps of: 

(a) obtaining single-stranded oligomers representing a set 
of vectors Vi^, each of which vectors represents an item of 
experience, 

(b) storing the items of experience in memory by forming the 
outer product over all the experience vectors V^^ for i^j : 

(c) retrieving a particular experience that is 
imperfectly represented as U^^, in accord with 

Vi = S(X Tij V, + U,^) , 
where the function S{x) is a saturating function, by finding a 
set of oligomer strands corresponding to the inner product 
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of the strands representing the T.. matrix and the strands 
representing vector U^", 

(d) hybridizing the oligomer strands representing to a 
hybridization array comprising a complete set of anchored 
and E, strands, washing the hybridization array to remove 
excess strands, and identifying the depot sites of the 
array that contain double-stranded oligomer complexes, 

(e) denaturing the duplex molecules in the hybridization 
array and collecting the set of oligomer strands S(Xi) 
representing the saturated X^ strands, 

(f ) repeating steps (c) , (d) , and (e) iteratively, using 
the Xi oligomer strands obtained in each previous iteration to 
obtain a new set of strands X^ * representing saturated X^ 
selected from the inner product of the strands representing 
the matrix and the strands representing X^, until two 
successive iterations yield the same set of oligomer strands 
representing the complete experience V^*^. 

8. The method of claim 7, wherein the oligomers 
independently comprise subunits selected from the group 
consisting of deoxyribonucleotides , ribonucleotides, and 
analogs of deoxyribonucleotides or ribonucleotides; and any 
single oligomer comprises one or a combination of two or more 
of said different types of subunits. 
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ABSTRACT 



A method for analog representation of the amplitudes of a 
vector in which a set of single-stranded nucleic acid 
oligomers and represents each m-component vector 
V = 2, e,, where Ei and are each in 1:1 correspondence 
with the basis vectors 1=1, 2,..., m in an abstract 

m-dimensional vector space. The E^ and E. oligomers have 
complementary sequences, and represent the i-th component of V 
for which the amplitude is positive and negative, 
respectively. The concentration of each of the oligomers E^ 
or Ei is proportional to the magnitude of the amplitude Vi of 
the i-th component of V. The oligomers independently comprise 
subunits selected from the group consisting of deoxyribonucle- 
otides, ribonucleotides, and analogs of deoxyribonucleotides 
or ribonucleotides, and any single oligomer can comprise one, 
or a combination of two or more, of said different types of 
subunits. The invention also includes methods for analog 
representation of the operations of vector addition and vector 
and matrix algebra that are implemented using vectors that are 
represented by sets of oligomers Ei and Ei as described above. 
The invention further includes a method for implementing an 
analog neural network, for which the data of each neuronal 
unit is represented by a set of oligomers Ei and Ei as 
described above; and interconnections and signaling between 
neuronal units are represented by sets of biochemical 
reactions that are analog representations of operations of 
vector and matrix algebra as described above. Application of 
a saturating function to a signal from one or more neuronal 
units to produce an output is represented by hybridizing a set 
of oligomers selected by such a set of biochemical reactions 
to a complete, sub-stoichiometric set of single-stranded Ei 
and Ei oligomers, and an output of the neural network is 
represented by a set of oligomers that specifically hybridize 
to such a sub-stoichiometric set of Ei and Ei oligomers. 
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